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AMENDMENTS TO THE CLAIMS: 

1-48 (Cancelled) 

49. (Currently Amended) A machine implemented method for providing statistics 
characterizing content for language translation, comprising the steps of: 

Accessin g obtaining r&aaeie content in a first language of a web site hosted by a web 
server through a publicly accessible path on the hitemet by crawling [[a]] tiie web site via 
following links to additional pages; 

parsing the obtained content in the first language into one or more translatable 
components; 

determining identifying whether there e xists at least one of the translatable components 
that does not have a corresponding translated component in a second language : and 

generating statistics based only on the at least one translatable component that does not 
have a corresponding translated component in the second language in order to estimate the 
amount of work involved in size the content for language translation firom the first language to 
the second language . 

50. (Canceled) 

5 1 . (Previously presented) The method according to claim 50, wherein the language 
translation includes human translating the one or more translatable components. 

52. (Previously presented) The method according to claim 49, further comprising the 
step of adding the one or more translatable components to a translation list for translation into a 
second language. 
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53. (Previously presented) The method according to claim 52, wherein: 

the first language includes one of EngUsh, French, Spanish, German, Portuguese, Italian, 
Chinese, Korean, and Arabic; 

the second language includes one of English, French, Spanish, German, Portuguese, 

Italian, Japanese, Chinese, Korean, and Arabic; and 

the second language is different from the first language. 

54. (Currently Amended) The method according to claim 49, wherein the step of 
acc e ssing obtaining includes retrieving the content in the first language firom a source in a first 
language. 

55. (Previously presented) The method according to claim 49, wherein each of the 
translatable components is one of: 

a text segment; 
an image file; 
an audio clip; 
a video clip; 
a file; and 

any combination thereof in an electronic data stream. 

56. (Previously presented) The method according to claim 49, further comprising the 
step of generating an identifier for each of the translatable components so that each of the 
translatable components is accessible via a corresponding identifier. 
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57. (Previously presented) The method according to claim 56, wherein the identifier for a 
text segment is generated using at least one of a hash code, a checksum, and a mathematical 
algorithm based on one or more text segments. 

58. (Previously presented) The method according to claim 49, wherein the statistics 
includes at least one of a file count, a page count, a text segment count, a unique text segment 
coimt, a word count, and a unique word count. 

59. (Previously presented) The method according to claim 49, wherein the step of 
parsing is performed based on at least one markup tag contained in the content in the first 

language. 

60. (Currently Amended) A machine implemented method for providing statistics 
characterizing content, comprising the steps of: 

accessing r e mot e obtaining content in a first language of a web site hosted by a web 
server through a publicly accessible path on the Ltitemet by crawling [[a]] ^ web site via 
following links to additional pages; 

parsing the obtained content in the first language into one or more components; 

determining whether there e xists identifsdng at least one of the translatable components 
that does not have a corresponding translated component in a second language ; and 

generating statistics based only on the at least one translatable component that does not 
have a corresponding translated component in the second language in order to estimate the 
amount of work involved in size th e cont e nt for language translation fi-om the first language to 
the second language . 
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61 . (Currently Amended) A system for providing statistics characterizing content for 
language translation, comprising: 

a content accessing unit configured for accessing remot e obtaining content in a first 
language of a web site hosted by a web server through a publicly accessible path on the hitemet 
by crawling [[a]] ttie web site via following links to additional pages; 

an information processing unit configured for parsing the content in the first language 
into one or more translatable components; 

a n identifying d e t e rmining unit configured for identifying determining wheth e r th e r e 
exists at least one of the translatable components that does not have a corresponding translated 
component in the second language ; and 

a statistics generation unit configured for generating statistics based only on the at least 
one translatable component that does not have a corresponding translated component in the 
second language in order to estimate the amount of work involved in siz e th e content for 
language translation firom the first language to the second language . 

62. (Canceled) 

63. (Previously presented) The system according to claim 62, wherein the language 
translation includes human translating the one or more translatable components. 

64. (Previously Presented) The system according to claim 61, wherein the content 
accessing unit is further configured for facilitating retrieval of content firom a data source in the 
first language. 

65. (Previously presented) The system according to claim 61, wherein each of the 
translatable components is one of: 
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a text segment; 
an image file; 
an audio clip; 
a video clip; 

a file; and 

any combination thereof in an electronic data stream. 

66. (Previously presented) The system according to claim 61, wherein the statistics 
includes at least one of a file count, a page count, a text segment count, a unique text segment 
count, a word count, and a unique word count. 

67. (Previously presented) The system according to claim 61, wherein the first language 
includes one of Enghsh, French, Spanish, German, Portuguese, Italian, Chinese, Korean, and 
Arabic. 

68. (Previously Presented) The system according to claim 61, wherein the information 
processing unit parses the content based on at least one markup tag contained in the content. 

69. (Previously presented) The system according to claim 61 , further comprising a 
translation list to which the one or more components are added for translation. 

70. (Currently Amended) A machine readable medium having data stored thereon, the 
data, when read, causing the machine to perform the following: 

acc e ssing r e mot e obtaining content in a first language of a web site hosted by a web 
server through a pubUclv accessible path on the Internet by crawling [[a]] tiie web site via 
following links to additional pages; 
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parsing the obtained content in the first language into one or more translatable 

components; 

d e t e rmining wh e ther th e r e e xists identifying at least one of the translatable components 
that does not have a corresponding translated component in a second language : and 

generating statistics based only on the at least one translatable component that does not 
have a corresponding translated componen t in the second language in order to estimate the 
amount of work involved in siz e th e content for language translation from the first language to 
the second language . 

71. (Canceled) 

72. (Currently Amended) The medium according to claim [[71]] 70, wherein the 
language translation includes human translating the one or more translatable components. 

73. (Previously presented) The medium according to claim 70, wherein the statistics 
includes at least one of a file count, a page count, a text segment count, a unique text segment 
count, a word count, and a unique word count. 

74. (Currently Amended) A machine implemented method for providing statistics 
characterizing content for language translation, comprising the steps of: 

accessing remote obtaining content in a first language of a web site hosted by a web 
server through a pubhcly accessible path on the Internet by crawling [[a]] tiie web site via 
following links to additional pages; 

parsing the obtained content in the first language into one or more translatable 
components based on markup tags contained therein; 
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determining whether there e xists identifying at least one of the translatable components 
that does not have a corresponding translated component in a second language ; and 

generating statistics based only on the at least one translatable component that does not 
have a corresponding translated component in the second language in order to estimate the 

amount of work involved in size the cont e nt for language translation from the first language to 
the second language, wherein the at least one translatable component is to be human translated. 

75. (Previously presented) The method according to claim 74, further comprising the 
steps of: 

obtaining tracking information corresponding to a source from where the content is 
retrieved; and 

utilizing the tracking information when additional content is to be retrieved from the 

source. 

76. (Previously presented) The method according to claim 75, wherein the tracking 
information includes at least one of state information or session information. 

77. (Previously presented) The method according to claim 76, wherein the tracking 
information is obtained via a cookie. 

78. (Previously Presented) The method according to claim 74, wherein when the content 
contains an electronic form, the method further comprising the steps of: 

populating the electronic form automatically with pre-defined information; and 
submitting the populated electronic form to the source. 

8 



Application No.: 10/784,334 

79. (Previously presented) The method according to claim 74, wherein the source from 

where the content is retrieved is a web site. 

80. (Previously presented) The method according to claim 74, further comprising: 
identifying an image file associated with the content in the first language as a translatable 

component; 

generating statistics for the image file. 

81. (Previously presented) The method according to claim 80, wherein the statistics for 
the image file is generated manually. 

82. (Previously presented) The method according to claim 80, wherein the statistics for 
the image file is generated automatically. 

83. (Currently Amended) A machine implemented method for providing statistics 
characterizing content for language translation, comprising the steps of: 

accessing remot e obtaining content in a first language of a web site hosted by a web 
server through a pubhclv accessible path on the Internet by crawling [[a]] tiie web site via 
following links to additional pages; 

parsing the obtained content in the first language into one or more translatable 
components; 

determining wh e ther th e r e e xists identifying at least one of the translatable components 
that does not have a corresponding translated component in a second language : and 

generating statistics based only on the at least one translatable component that does not 
have a corresponding translated component in the second language in order to estimate the 
amount of work involved in siz e the content for language translation from the first language to 
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the second language 




-to measure the size of a portion of t he 



content in the first language for language translation. 

84. (Currently Amended) A system for providing statistics characterizing content for 
language translation, comprising: 

a content accessing unit configured for accessing remot e obtaining content in a first 
language of a web site hosted by a web server through a publicly accessible path on the Internet 
by crawling [[a]] the web site via following links to additional pages; 

an information processing unit configured for parsing the obtained content in the first 
language into one or more translatable components; 



a n identifying d e t e rmining unit configured for identifying d e termining wheth e r th e r e 
€xists 3X Icd-St one of the translatable components that does not have a corresponding translated 
component in a second language ; and 

a statistics generation unit configured for generating statistics based only on the at lest 
one translatable component that does not have a corresponding translated component in the 
second language in order to estimate the amount of work involved in siz e th e cont e nt for 
language translation from the first language to the second language , wher e in th e statistics ar e 
ttsed to measure the size of a portion of the content in the first language for language translation. 

85. (Currently Amended) A machine readable medium having data stored thereon, the 
data, when read, causmg the machine to perform the following: 



server through a publicly accessible path on the Internet by crawling [[a]] tiie web site via 
following links to additional pages; 




obtaining content in a first language of a web site hosted by a web 
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parsing the obtained content in the first language into one or more translatable 

components; 

determining whether th e re e xiots identifying at least one of the translatable components 
that does not have a corresponding translated component in a second language : and 

generating statistics based only on the at least one translatable component that does not 
have a corresponding translated component in the second language in order to estimate the 
amount of work involved in size th e content for language translation, wherein the statistics are 
used to measure the size of a portion of the content in the first language for language translation. 
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